# Meeting Scenario Optimization
Vad
MIT
A voice activity detection model based on pyannote.audio, used to identify active speech segments in audio
Speech Recognition
V
salmanshahid
1,794
1
Speaker Diarization 3.1
MIT
An audio processing model for speaker diarization and embedding, supporting automatic voice activity detection and overlapping speech detection.
Audio Processing
S
tensorlake
393
2
Belle Whisper Large V3 Zh
Apache-2.0
A Chinese speech recognition model fine-tuned and optimized based on whisper-large-v3, showing significant performance improvements in multiple Chinese speech benchmarks
Speech Recognition
Transformers

B
BELLE-2
1,666
112
Featured Recommended AI Models